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Abstract 

Boosted resonances is a highly probable and enthusiastic scenario in any process probing the 
electroweak scale. Such objects when decaying into jets can easily blend with the cornucopia 
of jets from hard relative light QCD states. We review jet observables and algorithms that can 
contribute to the identification of highly boosted heavy jets and the possible searches that can 
make use of such substructure information. We also review previous studies by CDF on boosted 
jets and its measurements on specific jet shapes. 



I. INTRODUCTION 



Although the idea of looking inside a jet and study the radiation pattern of its constituents 
is not new [T], the large potential for searches of new electroweak scale particles has only 
been appreciated recently [2]. At the LHC with its 14 TeV center of mass energy, particles 
with masses around the electroweak scale are frequently produced beyond threshold, i.e. 
boosted transverse to the beam direction. Either because they recoil against other energetic 
objects or because they arise from decays of even heavier particles, e.g. Z' or i^i^-gluons. If 
the resonance's transverse momentum is bigger than their mass, their decay products tend 
to be collimated in the lab frame. 

Jets, collinear sprays of hadrons and the hadrons' decay products, are the most frequent 
objects at the LHC. They do not only consist of final state radiation (FSR) but also of 
radiation from the Underlying Event (UE) and pileup. Therefore, it is important to be able 
to disentangle the rare events with boosted resonances from the large QCD background based 
on their different radiation patterns. Many different approaches have been developed to 
exploit these differences and will be the focus of this review. In the subsequent introduction 
we describe how jets are defined and the types of jets we are interested in, and how to 
properly understand the background in these kinematic regimes. In Section [IT] we describe 
the different methods used to tag boosted jets through algorithms and observables that arise 



from the properties of the signal and background. In Section IV we describe a recent search 
at the Tevatron for boosted top jets and measurements of properties of highly boosted jets. 



A. Heavy Jets 

It is not unambiguous what to call a jet in an event. To be able to compare experimental 
results with theoretical predictions, jets have to be defined in an infrared safe way. Therefore, 
IR safe sequential jet algorithms became increasingly popular over the last years. These 
algorithms sequentially merge (by combining their four-vectors) the pair of particles that 
are closest according to some distance measure dy unless there is a distance diB (so-called 
beam distance) which is smaller than all d^, in which case particle i is called a jet and the 
algorithm proceeds with the remaining particles in the event. The most popular sequential 
jet algorithms are the kr [31 H] , the Cambridge/Aachen (CA) [HIE] and anti-fcr algorithm 
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[7J. Their measure dij is denned by 
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k,T : n=l , 

C/A: n=0, (1) 

anti— kx '■ n=-l, 



and diB = Vr'i- Here, R is the jet resolution parameter which specifies the size of the jet. 
These algorithms do not only ensure the comparability of Monte-Carlo (MC) calculations 
with experimental findings but by constructing the jet sequentially they also provide a 
recombination history of the jets' constituents. This history is attached to every jet in 
the event and allows to examine the jet at different resolution angles and intrinsic scales. 
Utilities for clustering and for studying the clustering history are available in Fastjet [H] and 
Spartyjet [9]. 

Different scenarios have been identified where jet substructure techniques are of value to 
improve phenomenological analysis. They can be cast into four different classes characterized 
by the resonance's pt and the overall business of the event. 

large p T , non-busy final state: If the event is generated with a large invariant mass and 
the number of physical objects is small, e.g. a heavy TeV scale resonance decays into 
two electroweak scale resonances (pp — > Xx e v — > 2 Yew — ► jets) ? the electroweak scale 
objects Yew tend to be highly boosted, their decay products are highly collimated 
and the radiation off the two Y E w are well separated with respect to each other. Due 
to Pt,y 3> m Y the fat jet's cone size does not need to be very large (R ~ 0.8) to 
catch all the necessary FSR to reconstruct the boosted resonances. Because of the 
relatively small cone size and the absence of many sources of hard radiation in the 
event, the effect of UE, pileup, and ISR is less pronounced and one finds rrij ~ my. The 
importance of jet grooming methods to remove uncorrelated soft radiation is reduced 
but jet substructure methods are necessary to discriminate between a resonance jet 
and a QCD jet. 

large p?, busy final state: SUSY cascade decays are examples where, depending on the 
mass splitting of the SUSY particles in the decay chain, the boosted resonances can 
have large transverse momentum accompanied by many hard jets: pp — » 2 Xx e v — ► 
Yew + jets — >■ jets. Considering UE, pileup, and ISR, the situation is similar to the 



non-busy final state, but the jet substructure methods might need to be adjusted to 
the many additional sources of hard uncorrelated radiation in the fat jet. 

medium pr, non-busy final state: If two electroweak scale resonances are directly pro- 
duced from proton collisions, pp —> Y EW Yg W , they are usually produced around thresh- 
old, e.g. pp — » HW . Only a small fraction of the events yield px,y ~ m Y- Still, 
focusing on this fraction of events can be a superior way to disentangle the signal 
from the backgrounds. Phenomenological studies [T0l - fT5] have shown that, because of 
the kinematic features, event reconstruction efficiencies, b-tagging efficiencies [TT| [12] 
and the jet energy resolution can be improved and that combinatorial problems in 
the identification of the decay products of Yew are reduced. To make as many signal 
events accessible to jet substructure methods as possible the fat jet's cone size has to 
be much bigger than in the highly boosted scenarios. Therefore, UE, pileup, and ISR 
affect the fat jet's mass much stronger and one should consider using techniques to 
remove uncorrelated soft radiation, e.g. jet grooming methods j2]. 

medium p?, busy final state: Events with many sources of hard radiation and less col- 
limated decay products of the Yew are the most difficult scenarios, irrespective if jet 
substructure techniques are applied or not. Although many of the advantages outlined 
for the medium pt non-busy final states can be carried over, combinatorial problems 
to discriminate between decay products and hard radiation from other sources in the 
fat jet can occur much more frequently. Taggers for Yew have to take that into ac- 
count and combine jet grooming techniques with criteria to discriminate Y EW decay 
products from other hard radiation. 

B. Light QCD Jets 

The best way to understand how to distinguish heavy jet events initiated by tops, or other 
heavy states from the overwhelming light QCD jet background, i.e., jets initiated by hard 
light quarks and gluons, is to understand the behavior of jets in the kinematical regions 
of interest. In particular since we are dealing with high-p^ jets, these should naturally 
have small R, or "cone" sizes, and thus allows to understand its analytical properties of its 
distributions versatilely. 



4 



A particular useful feature of the hadronic jet cross-sections which we can exploit is its 
factorization properties. Factorization theorems provide us with a framework with which 
to compute such cross sections. They allow to us to systematically separate long distance 
effects from short distance physics where perturbative QCD is applicable, i.e., when there 
is a large momentum transfer. Long distance effects are associated with the infrared regime 
of the theory, and thus are not perturbatively calculable. Together with a infrared-safe 
definition for the jets, the cross-section for n-jet production factorizes into the following 
decomposition, 

dVHAHB^-JnjRdef) _ f , , f H A(r x fH B( x 

j i * » i abed 

xH lLed fad, %b, {P%VJi,Rdef}) Sjj ({m 2 j. P%VJi, Rdef}) 

fljt\ml,pl, V ^R def ). (2) 



n 

X 

i=l 



where Rdef are the parameters associated with the particular jet definition, such as the cone 
sizes or transverse momentum parameters. 

The functions f^' B are the parton distribution functions for partons a, b in hadrons Ha,b- 
Ji are the jet functions, as the name implies are functions of only variables associatled to 
the particular Jet and its definition. 

The function Sjj represents the soft degrees freedom and describe wide angle radiation 
which correlates the final states, and thus also depends on the jet definition and jet observ- 
ables, but depend only in the color structure of the states involved in the hard scattering. 
The function H IJ describes the underlying hard perturbative process that produces the 
states that initiate the hadronic jets. It can be obtained from a partonic cross-section with 
the appropriate subtractions, the exact relations depends on the definition of the other 
factorized functions. 

The jet function for quarks at fixed jet mass is defined by the following matrix element, 

2 ^y W c J2 Tr {^<0|^(°> -00)9(0)1^)^19(0)^(0, -oo)|0)} 

xS (m] - rh 2 (N Jt )))5^(n - n(7V J J)5(p° J - u(N Jc ) (3) 
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while gluon jets are defined as, 




x5 (m 2 j - m*(JV> 4 )))«y< 2 >(n - n(iV Ji ))5(p° J - u(N Jc ). 



(4) 



They absorb collinear enhancements to the the outgoing particles in the underlying hard 
perturbative process. $s are path ordered exponentials(Wilson lines) defined by 



where £ is a direction with at least one component in the opposite direction of the jet, the 
hadronic cross-section is independent of this direction. The jet function is normalized such 
that at lowest order its only contribution is, 



Contributions to the jet mass from the soft function only start contributing at 0(R 2 ). Thus 
in the case of small jets, R < 1, most of the contribution are associated with the jet function 
whose behavior starts 0(logR 2 ). Note that the Soft and Jet functions are the only ones 
that depend on the particulars of the jet definition and thus contain all the substructure 
information. 

The rest of this discussion will deal only with high pt jets, since these are the jets that 
will have enough jet mass to be indiscernible from heavy jets initiated by tops or other 
electroweak processes, and thus are associated with small cone sizes. In this case the jet 
cross-sections for high p? and small cones simplify and analytical expression are rather 
manageable. At lowest non-trivial order, we have 



where Cj are the color factors associated with the representation of the particle initiating the 
jet. The contributions to the jet function from perturbative physics can be separated into 2 



that only connect the quark lines. These are congruent to the splitting function contributions 
to the cross-section and differ only by a normalization constant. 





(5) 





distinct components. As shown in Figure [TJ the first component constitutes of contributions 
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(a) ») (c) 

FIG. 1: Three seperate contributions to the Jet function in a general gauge 

The second contribution connects only to the Wilson lines, this component can be made 
identically zero by making convenient choice of gauge. It is also possible, in the case of 
light like wilson line to choose their directions to give a similar cancelation. The third is 
simply the components that can not be separated into the first or second types, and thus 
are associated with incoherent interference contributions to the jet cross-section and affect 
the distribution of radiation around the jet. This component can usually be ignored for 
heavy particles that are produced on-shell and can be expanded in 0(T p /m p ), where T p is 
the decay width of particle and m p its mass, e.g. the top quark or Higgs decaying into jets. 

Thus for studying of the background jets, i.e. light QCD jets, and their substructure one 
may need to include at least components (a) and (c), while if one would like to focus on the 
substructure of high pr heavy particles, i.e. their radiation pattern, to 0(T p /m p ) it's given 
completely by the splitting function contributions. 

One interesting feature at leading order, is that the jet mass distributions are independent 
of the properties of the other jets in the event. Therefore a measure of invariant mass 
distributions of each jet should provide a particular useful cut on the contamination of 



the light QCD jets in massive jet regions, such an observable is described in Section [TV 
Nonetheless such observables can be plagued by non-global logarithms, and further study of 
its high-order properties is warrant. Particularly since Monte Carlo studies [16J have shown 
a slight correlation between the leading jet mass and sub-leading jet mass. 

Beyond leading order one would need to include contributions from the soft function, 
whose definitions depend of the specific final states. In the case light QCD states it can be 
simply described by the scattering of Wilson lines, however these would need to be properly 
subtracted to avoid double counting present from the current definitions of the jet functions. 

Jet filtering methods described below associated with removing soft radiation from jets 



7 



with a large cone size, algorithmic go through subjets and place cut precisely on their 
contributions from these jet functions. Since for heavy particles the only contributions are 
associated with the splitting functions, analysis of the ratios of energies and distances of 
substructure relative to the jet can be well understood at the leading log level. 

C. Jet grooming methods 

The purpose of the three methods called Filtering [10] , Pruning [T7J [18] and Trimming 
[19] is to remove soft radiation from jets with a large cone size, e.g. R = 1.0. Radiation from 
the UE or pileup tends to be soft, especially compared to the decay products of a boosted 
heavy resonance, but it has a big effect on the jet's mass rrij, i.e. the UE contribution to 
Sm'j increases with R 4 . 

Filtering and Trimming denote similar procedures. The fat jet's constituents are recom- 
bined with a smaller R which results in several subjets. Only a subset of these subjets are 
kept, based on a specific criterium which is often a pr threshold. The criterium to discrim- 
inate between hard and soft is different for Filtering and Trimming. While Filtering keeps 
a fixed number n/ of the subjets, Trimming keeps only the subjets which fulfill 

Ptj > f x A, (7) 

where / is an adjustable parameter and A is an intrinsic scale of the fat jet, e.g. the fat-jet's 
transverse momentum. Note that although, / is usually the same for all jets in the event 
but A is not. Hence, the subjet pt threshold is different for every jet. 

Pruning is a method which removes radiation from UE and pileup while clustering the 
fat jet. The sequential jet algorithm continues until no pair of constituents has a 

Rij > D cut . D cut represents an effective subjet cone size for this algorithm and is usually 
associated with an intrinsic fat jet scale, e.g. D cut = mj/pxj- Now the protojet combination 
continues, but with an additional restriction: each pair of protojets that are due to be 
combined have to meet 

\PT,i+PT,j\ 

If z < z cnt the constituents i and j are not combined the one with smaller transverse 
momentum is discarded for the jet clustering. The algorithm continues until all constituents 
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have either been combined or else eliminated. 

For Trimming and Pruning the criteria to discriminate between FSR and UE/pileup 
adjust themselves based on quantities of the fat jet. To make Filtering work efficiently it is 
necessary to know the number of decay products of the resonance one seeks to reconstruct. 
Therefore Trimming and Pruning can be considered to be generic tagging algorithms for 
boosted resonances, particularly if other sources of hard QCD radiation are absent. 

It has been shown that all of these jet grooming techniques improve the jet's mass resolu- 
tion when compared to the raw jet mass [2J, e.g. the mass peak of a boosted top is narrower 
after applying grooming techniques. While the three methods perform very similarly in 
extracting the decay products of a hard resonance, they perform differently for QCD jets. 
This feature can be used to increase the significance of a resonance search [15]. As long 
as the cuts are place on the physical properties of the subjets, such as jet momentum and 
its IR safe jet observables, the Infrared Safety of these methods is congruent with the jet 
algorithm used to find the subjets. 

II. TECHNIQUES TO RECONSTRUCT BOOSTED RESONANCES 

A large set of resonance reconstruction techniques has been developed, most of which 
focus on either reconstructing two pronged, e.g. Higgs boson, or three pronged decays, e.g. 
top quark, while some techniques are applicable to both decay patterns, e.g. N-subjettiness 
or energy-flow methods. 

A. Two pronged decays 

Depending on the pt of the resonance, the cone size of the fat jet has to be chosen in such 
a way that the resonance's decay products are contained in the jet to allow for a successful 
reconstruction. As a rule of thumb, this can be achieved by choosing R ~ 2m/pT- Most 
subjet methods are based on the kx or CA algorithm. CA is a purely geometric algorithm 
which is not affected by the transverse momentum of the jets constituents, whereas the fcy 
algorithm has a tendency to start with combining the softest constituents first and finishes 
in the last step with subjets which are hard and far apart, see Eq.([T]). Therefore, the kx 
algorithm has a natural tendency to revert the shower process when recombining the jet. 
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While, the anti-fey algorithm provides a nice circularly shaped cone for the hardest jet in 
the event, it starts by combining hard jet constituents first, not following the naive shower 
picture. The anti-fey algorithm is therefore less suited for subjet methods than the k T or 
CA algorithms. 

In an early study by Seymour pQ it was shown that before detector effects were taken into 
account, clustering algorithms, a class that includes algorithms such as the fey algorithm, 
provided a much better reconstruction of the mass of a heavy boson decaying into the jet 
than conventional cone algorithms. A procedure now called "YSplitter" was used in |20l .'' ], 
in which the authors used the fey algorithm to study the dij distance in the final merging 
to reconstruct W bosons. QCD backgrounds are likely to have small d^ values whereas W 
jets tend to have values correlated with the mass of the W boson. 

Butterwoth, Davison, Rubin and Salam [10] proposed a method which combines filtering 
and a so-called procedure to reconstruct two pronged decaying resonances. This now goes 
by the name of "BDRS method," and it makes use of the purely geometric substructure 
clustering of the CA algorithm. They suggested to undo several steps of the jet cluster- 
ing until a substantial mass-drop emerges, e.g max(ji, j 2 )/^i,j 2 < A 4 ? with \i substantially 
smaller than 1, and the transverse momentum of the subjets is required to be balanced, 
e.g. mm(pTj 1 , pr,j 2 ) / max (PT,j 1 , Ptj 2 ) > Vent- Wide-angle UE and pileup radiation is actively 
removed in this process but the two resulting subjets after meeting the mass-drop condition 
might still contain uncorrelated soft radiation. This radiation is removed in a following step 
by applying Filtering on the constituents of the two subjets. The constituents are reclus- 
tered with CA, using R = min(0.3, ARj u j 2 /2) and only the three hardest of these subjets are 
taken. The third subjet is taken not to miss the soft wide-angle radiation off the decay prod- 
uct's color dipole. This method proofed to be successful in reconstructing the Higgs, Z and 
W bosons masses without introducing artificial scales in the QCD jets' mass distribution. 

Variations of the BDRS method have been exploited in studies with the purpose to 
reconstruct the Higgs boson in a busy final state, e.g. ttH [H] or SUSY cascade decays 
[2U[22]. In busy final states the two resulting subjets after meeting the mass drop condition 
might not be directly correlated to the decay products of the resonance. Therefore, it can 
be advantageous not to stop after the mass-drop condition is met for the first time, but to 
keep undoing the jet clustering while monitoring the structure of each individual mass drop. 
After going through the whole tree the two relevant subjets can be selected based on a well 
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suited subjet criterium. 



B. Three pronged decays 

The most prominent example of a three pronged decay used for subjet studies is the 
decay of a top quark: 

t W + b -»■ q'qb. (9) 

Initially studying boosted tops was motivated by the potential decay of a very heavy reso- 
nance {m x ~ 0(1) TeV) into tops [23H26] . leading to highly collimated three-pronged decays 
(PT.top > 500 GeV). The two tops recoil against each other and the reconstruction of the 
heavy resonance, e.g. a Z' or K K-Ghion, is jeopardized because the top decay products can 
overlap and have to overcome large QCD dijet or W^+jets backgrounds. Jet substructure 
methods are unavoidable under these circumstances. The fact that the decay products of 
the two tops are well separated and the knowledge of the top and W mass can be used to 
develop so-called top taggers with good reconstruction and background rejection efficiency 



Often top taggers are inspired by the ideas outlined in Sec. (II A). The performance of several 
of the taggers described in this section were compared in [2]. 

Thaler and Wang [27J utilize a similar approach. A jet is reclustered with the kx algo- 
rithm, until two or three subjets are formed. Internal kinematic variables in addition to kx 
scales are combined to disentangle the top jet from the background. For example, in the 
three-subjet analysis, a W boson candidate is identified by forming the minimum pairwise 
mass between subjets and a minimum cut is placed on its mass. Relative energy sharings 
between the subjets were also studied. 

A top tagger early applied by ATLAS [28, 29J is an extension of the "YSplitter". Using 
the kx algorithm, the djj values of the next-to-last and next-to-next-to-last clusterings are 
combined with the jet's mass in a multidimensional space, which yields enough information 
to apply cuts or a likelihood ratio to discriminate between top jets and QCD jets. The 
tagger performs particularly well for large tagging efficiencies [2] . 

The "Hopkins" top tagger is based on the mass-drop procedure of the BDRS study 
[301 EI]- The fat jet is clustered using the CA algorithm. This jet is then decomposed 
by reversing the cluster history until a branching is found, such that the Pt of the subjets 
is balanced and the distance between the two subjets is not too small. This procedure 
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is repeated on the two subjets. A jet is considered a top-jet candidate if after the first 
mass-drop at least one more mass-drop was found. On the resulting 3 or 4 subjets several 
kinematic cuts are aplied: The invariant mass of the all subjets should lie near m t , two of 
the subjets are required to reconstruct new mw and their helicity angle should not be too 
small. 

The "Hopkins" top tagger has been modified by CMS |32j, where the W boson mass 
window cut and the helicity angle cut have been replaced by a single cut on the minimum 
pairwise subjet mass. 

All of these taggers are focusing on reconstructing top quarks with large transverse mo- 
mentum, therefore none of these taggers applies a jet grooming procedure like Filtering, 
Pruning or Trimming, which makes them more sensitive to the fat jet's cone size. 

The Heidelberg-Eugene-Paris (HEP) top tagger [T4"t 133] was designed to work in the 
medium p? region for busy and non-busy final states, i.e. with very large jet radius R = 1.5. 
Inspired by the BDRS method, the tagger uses a combination of mass drops and Filtering. 
While branching through the whole jet's recombination history down to a subjet-mass scale 
of 30 GeV. In every merging very soft subjets are discarded. To select the decay products of 
the top quark a minimization procedure of the remaining hard substructure is applied. Every 
set of constituents of the three subjets is reclustered with CA, R = min(0.3, (ARij/2)) and 
rif = 5. The set of three initial subjets that gives a Filtered mass closest to the top mass is 
retained as top candidate. This top candidate is reclustered again to yield exactly 3 subjets, 
which are the anticipated decay products of the top quark. The invariant-mass combinations 
of the subjets are placed in the two-dimensional subspace of (m23/mi23, arctan(mi3/mi2)). 
For tops, one of the combinations is required to satisfy m^/m^ — m^jm t . To further 
increase the efficiency of the tagger, dipolarity [3J] was proposed to single out the decay 
products of the W. 

Three pronged decays can also occur in searches of new physics. In [33], the authors 
explore two tagging methods in R-parity-violating neutralino decays. One method is sim- 
ilar to the YSplitter, cutting on dimensionless ratios as d^/m 2 . The other method is CA 
based and searches the entire cluster history of the fat jet. For every merging which is not 
too asymmetric in px the Jade-distance PT1PT2AR-12 is recorded. The clustering with the 
largest Jade-distance defines the neutralino candidate and to ensure 3-body kinematics, a 
cut is placed on the ratio of the masses of the subjets with second-largest and largest Jade 
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distances. 



C. Jet-shape and energy-flow methods 

Infrared safety of observables guarantees that we can make meaningful comparisons be- 
tween theoretical computations and experimental measurements. Such observables should 
not probe regions directly associated with infrared singularities of the theory, since within 
these regions the perturbative cross-sections are ill-defined. These regions are associated with 
collinear and soft degrees of freedom, therefore infrared-safe observables should be insensi- 
tive to collinear splittings of states or additional soft momenta. These observables should 
be insensitive to non-perturbative physics, since these are the same regions are associated 
with long distance effects that are not a priori computable in the framework perturbative 
QCD. 

Such observables can be built from information of the states within the jet. We call 
such IR-safe observables, jet shapes, since they should be sensitive to the distribution of 
energy and momenta within the jet. There have been a variety of jet shapes introduced, 
some are simply applications of previous event-wide observables, i.e. event-shapes, applied 
to the subset of event states that clustered into a jet. Therefore they must be defined for 
any number of states that may define the jet. 

One such observable used in the identification of massive jets is the so-called Planar flow 
proposed in (271 [361 EZJ- For example, in (361 EI], one begins by defining a tensor I w that 
depends on the energy of the states defining the jet and their momentum transverse to the 
direction of the jet momentum, p i k . 

£ = —£«•—-■ do) 

One can then construct an IR-safe observable from the eigenvalues of this tensor, such as 
the one defined in [361 EZ] > 

P f = 4 det/ " . (11) 

The interesting the feature of this observable is that it's zero for any linear distributions 
in the plane transverse to the jet axis. Therefore, if the jet decay is dominated by single 
emission from the particle initiating the jet, then its Planar flow distribution should peak 
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around zero. Infrared safety of this observable makes it less sensitive to hadronization effects, 
and therefore perturbative features such as low planar flow are expressed at the hadron level. 

It was first used in distinguishing high-p^ top decays as in Eq. [9j where the background, 
high mass light-QCD jets, is dominated by single gluon emission. Therefore it greatly 
differentiates light-QCD jets from top jets, whose decay chain leads to a flat Planar flow 
distribution. 

Another use of Planar flow in [38H4"0] where it was used in a contrasting manner, not to 
find background events with low planar flow as above, but to find signal events with lower 
planar flow than the background. If we are interested in looking for color neutral jets, then 
most of the radiation is exchanged among the subsequent QCD final states in the decay 
chain, since at the level of leading color the radiation is color blind to rest of the event. 
Angular ordering of soft gluon emission ensures that more radiation off a color dipole can be 
found between the color connected partners |41j . This leads to a linear distribution of high 
final states in-between two hard events within the jet. Therefore, Planar flow distribution 
for electro-weak boson jets, or other exotic color neutral jets, will peak at lower planar 
flows then QCD jets. The method should be particular useful for high— regions where 
light-QCD jets are contaminated by extra radiation, i.e. where single gluon emission doesn't 
dominate. 

The use of the energy distribution between two jets or subjets to distinguish the color of 
the final states in an event was discussed in (121 US]- In [33] an observable called "pull" was 
developed, it makes use of the gradient of the transverse energy distribution for each of jets in 
an event. The angle between the gradients provides a measure of the color pertaining to the 
original state that decayed into the two jets (or subjets). This observable has shown great 
promise in eluding the color structure of not only in jets themselves but also event-wide. 

Another method to distinguished heavy jets using their energy distributions was intro- 
duced in [ID]- The authors began by trying to obtain the utmost distinction of two jets by 
the difference in their energy distributions, by introducing a method to match their IR-safe 
energy distributions between fixed order energy flow calculations and the measured energy 
distributions. The measured energy distributions can be match to a set of fixed templates, 
which describe the kinematical information from signal or background. It proved particu- 
larly useful when combining with cuts in jet shapes, such as Planar flow, or in the kinematics 
of the matching set of templates. The method itself is versatile enough to work for a range 
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of processes, it is particular useful for events where the energy distribution is all that is 
available. 

Recently, N-Jettiness has been introduced, an event shape observable which is designed 
specifically to facilitate the theoretical description of exclusive N-jet final states. Based 
on jB], in jl5] and H7] the approach was adapted to construct a jet shape observable 
N-subjettiness tn. tn is calculated by minimizing 

tn = T~y~l Pr,k min {AR hh ■ ■ ■ AR Ntk } , (12) 
d ° k 

with 

do = ^p T ,kR- (13) 

k 

Here the index k runs over all constituents of the fat jet with cone size R. By minimizing rjy, 
N axes of candidate subjets are constructed, tn vanishes if the number of subjet candidates 
coincides with the number of jet constituents. Therefore, can be used to measure the 
number of hard isolated energy deposits inside a fat jet which can be helpful to disentangle 
resonance jets from QCD jets. To ameliorate the effect of hard uncorrelated radiation in the 
fat jet on tn, it can be advantageous to consider t^/tk, with K ^ N. 

Looking for mass-drops while branching backwards through the jet recombination tree is 
a recurring motif in many resonance reconstruction techniques. An approach avoiding trees 
was proposed in [IB]. The angular correlation function 

g{R) = ZW^T, l3 \ ^ 

^i&PTiPTjARfj 

where the sum runs over all pairs of constituents of the jet and Q(x) is the Heavi- 
side step function, provides a profile of the jet radiation. The variation of Q with R, 
d\ogQ{R)/d\ogR > 0, yields a topographic profile of the jet. In jlH] the hight and number 
of peaks is used tag top-quark jets. 



D. Leptonic top quark reconstruction 

On the one hand, the lepton, the b-jet and the sizable amount of missing transverse 
energy provide strong handles to suppress backgrounds if the signal contains a leptonic 
top. On the other hand, the invisible neutrino momentum and the difficulty of measuring 
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the neutrino's transverse energy precisely, hampers the top quark's reconstruction. If the 
top quark is boosted to very large px, standard lepton isolation criteria reduce the signal 
efficiency and b-tagging can be severely degraded. Using a "mini-isolation" cut [IH] at the 
tracker level for muons is a tempting option to reject a large fraction of background events 
while retaining a good signal reconstruction efficiency. Alternatives to lepton isolation cuts 
have been proposed in [27] making use of kinematic correlations of the top decay products. If 
the lepton is isolated, knowing the neutrino momentum ensures a good reconstruction of the 
top's momentum. In many searches the neutrino momentum is reconstructed by assuming 
that all missing transverse energy origins in the neutrino momentum, in which case the 
neutrino can be reconstructed by requiring m v i = mw- I n [HD] a novel way of reconstructing 
the neutrino momentum was introduced, using only the lepton and b-jet momenta. By not 
relying on the missing transverse energy vector the method is less sensitive to systematic 
uncertainties and can be applied to searches where more sources of MET are present, e.g. 
pp it -^tt + MET. 

E. Maximum information approaches 

The outlined jet substructure methods apply different approaches and ideas to disentangle 
the sought hard substructure associated with the hard interaction of the event from QCD 
backgrounds. Hence, the different methods might extract complementary information with 
respect to each other and it can be beneficial to combine them to obtain a better signal 
purity or background rejection |15j . 

In [51] the authors combined many jet and event shape observables to improve the signif- 
icance for the HV search with H — > bb at the Tevatron. These observables were processed 
and analyzed using Boosted Decision Trees to increase the statistical significance over cut 
based studies. Systematic uncertainties are difficult to asses, but a significance increase of 
10 — 20% was found compared to the variables used by the CDF and DO collaborations. 
A similar approach was used for the analysis of boosted W in |52j, where grooming tech- 
niques and jet substructure observables like R-cores were combined to improve on the BDRS 
method. After processing them through a Boosted Decision Tree, a significance improvement 
for highly boosted W over the BDRS method was found. 

The method "Shower deconstruction" [53J aims to be a maximum information approach 
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by acting on the smallest practical pieces of radiation in an event, e.g. topo-clusters, 
calorimeter towers or just small jets. Shower deconstruction then assigns to each event 
a number obtained from an analytic function that is an estimate of the ratio of the probabil- 
ity for a signal process to produce that event to the probability for a background process to 
produce that event. The analytic function to calculate these probabilities is based on first 
principle QCD and mimics what full event generators do. The method was applied to the 
HZ final state and was found to improve on the BDRS study in this channel. 

III. SEARCHES AND APPLICATIONS 

The existence of boosted resonances is a kinematic necessity if electroweak scale physics is 
probed at a multi-TeV collider. The reconstruction of boosted electroweak scale objects can 
be useful in a plethora of searches for new physics. Thus, the jet substructure tools outlined 
in SecjTTJhave been used and developed for this purpose. Most new physics scenarios propose 
heavy particles which decay subsequently in electroweak scale particles. Therefore, searches 
which make use of jet substructure can provide a superior way to discover new physics by 
reconstructing boosted particles like electroweak gauge bosons, the Higgs boson or the top 
quark. 

A. Boosted electroweak gauge bosons 

The first applications of jet substructure techniques were focusing on the reconstruction 
of boosted hadronically decaying gauge bosons [20J. Studying longitudinal vector boson 
scattering can provide insights into the nature of electroweak symmetry breaking, especially 
if no Higgs boson is found. For this purpose, the vector boson fusion process, where two 
electroweak gauge bosons are produced in association with two tagging jets, is particularly 
suited. Standard searches rely on leptonic boson decays. Allowing them to decay hadron- 
ically can increase the sensitivity of the study. In [HI] this was further treated using the 
polarization of the vector bosons. 

SUSY cascade decays are a source of boosted electroweak gauge bosons. In [35] their 
reconstruction was used to obtain information about masses and branching ratios of SUSY 
particles. 
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A heavy Higgs boson (m^ > 300 GeV) decays most of the time into gauge bosons. Its 
main discovery channel, the so-called "gold plated mode" , relies on two boosted leptonically 
decaying Z bosons. This channel has only small backgrounds and provides a very clean 
signature with a good S/B. However, using the BDRS method improved by a combination 
of Trimming and Pruning [15] , even the semihadronic channel was found to be of comparable 
significance as the purely leptonic channel, despite its much bigger backgrounds [55]. The 
semihadronic channel also provides sensitivity to the CP property of a heavy scalar resonance 
[ST)] . Generically, the decay of very heavy resonances, e.g. Z' , can result in highly boosted 
Higgs boson or electroweak gauge boson which have a sizable branching ratio into bottom 
quarks or taus. This scenario is discussed in [53, |58] ■ 



B. Boosted Higgs bosons 

Hadronic decays of the Higgs boson, e.g. h — > bb, have played an important role in 
searches at the Tevatron. At the LHC, due to an increased Higgs-production cross section 
and a simpler reconstruction, most of the searches rely on decays to gauge bosons or leptons. 
The Higgs boson decays into a collimated bb pair with a large branching fraction. The 



analysis [TTJ] ITT] makes use of the BDRS method outlined in Sec II A It demonstrates that 
the VH production channel can be a viable candidate for the discovery of a light Higgs boson 
at the LHC. The ATLAS collaboration reproduced this result in a full-detector simulation, 
finding S/y/B ~ 3.7 after C = 30 fb _1 considering only statistical errors. 

In early ATLAS and CMS reports [59H6T] the tiH production channel with subsequent 
Higgs decay to bottom quarks was one of the major discovery channels for a light Higgs bo- 
son. Further studies revealed a very poor signal-to-background ratio of 1/9 [U2], making the 
channel very sensitive to systematic uncertainties which might prevent it from reaching a 5a 
significance for any luminosity. However, at high transverse momentum, after reconstructing 
the boosted, hadronically decaying top quark using the HEPTopTagger as well as the Higgs 
boson with a modified version of the BDRS method, and requiring 3 b-tags, the signal-to- 
background ratio can be improved to ~ 1/2, while keeping the statistical significance at a 
similar value to that in Ref. [52] . 

If the scalar sector is more complex than in the SM predicted, the Higgs boson can decay 
into light pseudo-scalars a and possibly evade constraints from the LEP experiments. If 
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m a <C nih the light scalars are boosted and the decays to gluons [381 EH], light quarks or 
taus [U3] can dominate. All of these studies exploit that the radiation pattern of a color 
singlet state differs from QCD backgrounds. They conclude that the Higgs boson can be 
reconstructed with as little as C = 20 — 100 fb" 1 . The lightest CP-even Higgs boson of the 
MSSM might even be discovered earlier than in any other channel, if searched for in Squark 
or Gluino cascade decays [2T| 122] . 

C. Boosted top quarks 

The need to be able to reconstruct boosted tops from a heavy resonance [23H26] or top 
partner decay [331 E01 El] has motivated and promoted the field of jet substructure like 
anything else. Top quarks from heavy resonances, e.g. Z' — > ti, are naturally boosted with 
PT,t ^ 500 GeV. While top quarks produced around threshold yield well separated decay 
products and can be efficiently discriminated from QCD jets, at a first glance, a highly 
boosted top looks like one single massive jets. Jet substructure methods are essential to 
discriminate between QCD and top jet. Because of the technical difficulties in reconstructing 
boosted tops a lot of work has been devoted to developing tools, whereas the number of 
papers applying these tools to new physics searches is still relatively small - yet growing. 

The Hopkins top tagger has been used in phenomenological studies reconstructing KK 
vector boson resonances [65] and full detector simulation studies have been performed by 
CMS with a slightly modified implementation [32]. For the reconstruction of a heavy Z' 
[66] between 1-4 TeV they find a good background rejection and signal efficiency. Using the 
"Hopkins" top tagger can increase the sensitivity for Z' with m Z t > 2 TeV compared to 
standard reconstruction techniques. 

Because scalar top partners can ameliorate the top quark's impact to the hierarchy prob- 
lem of the Higgs, they are among the most anticipated particles to be found at the LHC 
[67] . The HEPTop Tagger was applied [33J to reconstruct the light top squark of the MSSM 
in a final-state with only jets and missing transverse energy, pp — > tt — > ttxiXv While a 
reconstruction with standard techniques yields S/B ~ 1/7, a subjet analysis in combination 
with m T2 [SSI ES] can result in S/B ~ 0.88 and S/yB ~ 6 after 10 fb _1 . If one of the tops 
decays leptonically a leptonic top tagger can be used to separate the neutrino's MET con- 
tribution from the neutralinos' MET, which allows an effective use of vrtTi- The significance 
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in the channel with semileptonic top decays is similar to the all-hadronic channel |50j . 

The top can be produced from a top partner decay in association with a Higgs boson. 
Even if the top and the Higgs are not highly boosted they can be reconstructed using the 
BDRS approach and the HEPTopTagger, with only 10 fb™ 1 at 14 TeV [64J. 

IV. MEASUREMENTS 

In this section we discuss the experimental measurements of boosted jets at CDF. Ex- 
perimental studies at the Tevatron were limited to jets with px < 400 GeV (TOj [71]. Results 
on jets with higher pt produced at the LHC have been published recently [72] . 

The CDF study presented here [731 HI] was the first to look at substructure of massive 
jets with pj- > 400 Gev. Both Midpoint and anti-fcr jet algorithms were used, with a typical 
jet size of R = 0.4,0.7, and 1.0. A pileup correction technique was used to remove contri- 
bution of non-coherent energy deposits, critical in substructure studies. Three observables 
were measured. The jet mass was compared to the theoretical next-to-leading order (NLO) 
prediction of the jet function [37]. Angularity and planar flow [36] served as further support 
to the hypothesis of the two-prongness of QCD jets. Finally, a boosted top search [75] based 
mostly on the jet mass will be reviewed. 

A. Pileup Correction 

Pileup is the result of additional collisions in the same bunch crossing. The number 
of reconstructed vertices, N vtx , is a good estimate for the number of multiple interactions. 
Its average in the studied data was ~ 3. Pileup creates additional energy deposits in the 
detector which are crucial for substructure studies. This problem becomes more severe at 
the large luminosities expected at the LHC. Moreover, the pileup in the MC samples used 
is not necessarily modeled exactly as the data. Therefore a data driven technique was used 
to measure the shift in various observables due to additional incoherent energy in the jet 
and correct for it [731 ESI [77] . A complementary cone, one with the same size parameter 
R, is set at 90° in azimuth away from the leading jet. The energy deposits inside this cone 
are rotated back and added to the jet. Then the shift in the interesting observables can be 
measured as a function of the value of these observables. Due to the large event-by-event 
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fluctuation in this energy, the mean shift was used for correction instead of an event-by- 
event value. Furthermore, since the underlying event contributes some coherent energy to 
the final state, its contribution was separated from that of the multiple interactions by the 
following method: the shift from single-vertex events (assumed to contain only underlying 
event contributions) was subtracted from the shift in multi-vertex events (containing both 
underlying event as well as multiple interaction contributions). 

This technique was implemented on the jet mass, reducing the systematic shift in the jet 
mass distribution due to pileup. Finally, an analytic calculation for deriving an approxima- 
tion to these shifts in the observables is presented in [76J, and the relation with the concept 
of jet area is discussed. 

B. Jet Mass 

The jet function [37] predicts both the shape of the jet mass distribution, given its 
functional form, as well as the absolute normalization. In the reviewed study, an NLO 
approximation was used, valid for jet masses above the low mass peak and below the value 
of R ■ pt where R is the typical size used to associate particles to a jet. Higher-order 
corrections were estimated to be ~ 30%. 

Fig. [2] shows a comparison of the jet mass distribution for a cone size R = 0.7 with the 
analytic predictions for the jet function. To conform to the validity range of the jet function, 
this comparison is made for jet masses above 70 Gev 2 and up to 280 Gev 2 . The analytical 
prediction for quark jets describes approximately the shape of the distribution and fraction 
of jets but tends to over-estimate the rate for jet masses from 130 to 200 Gev 2 . pQCD 
indeed predicts that ~ 80% of these jets arise from quarks [75] . 

The data and the PYTHIA distributions are in reasonable agreement. Furthermore, as 
shown in the inset plot, a good agreement is observed between the Midpoint and the anti-fcr 
algorithms. Since the jet mass is and NLO effect (a gluon emission is needed) this is very 
interesting considering the fact that Midpoint is shown to be IR3+1 [79J whereas anti-fcy is 
IR-safe. 
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CDF Run II, L. = 6 fb 




m iet1 [GeV/c 2 ] 



FIG. 2: The normalized jet mass distribution for Midpoint jets with > 400 Gev and 
\rj\ G (0.1,0.7). The uncertainties shown are statistical (black lines) and systematic (yellow 
bars). The theory predictions for the jet function for quarks and gluons are shown as solid 

curves and have an estimated uncertainty of ~ 30%. Also shown is the PYTHIA MC 
prediction (red dashed line). The inset compares Midpoint (full black circles) and anti-fcr 

(open green squares) jets [73] 

C. Angularity 

Angularity was shown to qualitatively distinguish between QCD jets and other two-body 
decays [3Bj . The reviewed study showed that the shape of the angularity distribution agrees 
with the two-prong description of high p? massive QCD jets, an assumption that the jet 
function is also based upon. This agreement is manifested in the two kinematical limits: 
r™ 2 m ~ (rrij /2pTj) 3 is obtained from decay configurations in which both daughter particles 
are emitted at the same angle with respect to the direction of the mother particle and have 
the same energy. r™ 2 a:r ~ 2~ 3 R 2 rrij/pT,j is obtained when one of the decay daughters is hard 
and almost collinear with the mother particle, whereas the second decay daughter is soft 
and emitted at a large angle, limited by the size parameter R. 

Fig. [3] shows that the angularity distribution indeed lies between the two expected limits. 
A good agreement is observed between the data and the PYTHIA sample. Furthermore, as 
in the jet mass case, an agreement is observed between the Midpoint results and those of 
anti-A/r- 
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FIG. 3: The angularity distribution for Midpoint jets with px > 400 Gev, \r)\ G (0.1,0.7), 
and m jetl G (90, 120) Gev 2 . Also shown are the PYTHIA calculation (red dashed line) and 
the pQCD kinematic endpoints. The inset compares the distributions for Midpoint (full 
black circles) and anti-k T (open green squares) jets 



D. Planar Flow 



Planar flow [361 EZ] describes the way energy is deposited on the plane perpendicular 
to the jet axis. Due to the two-prong-nature of massive boosted QCD jets, their energetic 
signature in this plane is expected to be composed of two energy deposits, lying on a single 
line. This ideal configuration has a planar flow value of zero. The always existing soft 
contribution inside the jet shifts the most probable planer flow to a value somewhat higher 
than zero. 

On the other hand, consider a hadronic decay of a boosted top quark, in which the b 
quark and the qq pair are clustered together in a single jet. The three-pronged energy 
signature, with varying distances and energy distribution will yield a rather uniform planar 
flow distribution. This makes planar flow a good handle for separating two-prong decays 
from three-prong decays. 

Fig. [4] shows the planar flow distributions for jets in a mass window of 130 to 210 Gev 2 , 
a range relevant for searches of boosted tops. The data exhibits the expected QCD like 
behavior and peaks at a low planar flow value. This should be compared to the ti MC 
sample that exhibits a flatter distribution. Once again the two jet algorithms show good 
agreement. This is particularly interesting in the case of planar flow which deals with three- 
body configurations and considering the IR3+1 ranking of Midpoint. 
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FIG. 4: The planar flow distributions for Midpoint jets with pt > 400 Gev, \t)\ G (0.1,0.7), 
and mP etl G (130,210) Gev 2 . Also shown are the pythia QCD (red dashed line) and ti 
(blue dotted line) jets, as well as the results from the two jet algorithms (inset). All 
distributions have been separately normalized to unity 



E. Search for Boosted Tops 



Given the understanding of the jet substructure from the reviewed study, and in particular 
the jet mass, a technique was developed to separate the signal of boosted tops from the 
primary background to this search, which come from the production of massive QCD jets. 
Furthermore, a data-driven technique to estimate this background was used. 

In QCD, no dependence is expected in principle between the masses of the two leading 
jets in a typical dijet event. On the other hand, in all hadronic ti decays, one expects two 
massive jets recoiling one against each other. This is shown for the data and ti MC sample 
in Fig. |5j An upper cut on the significance of the missing energy was applied to reject 
semileptonic events. 

The signal region, D, was defined to include two massive jets in the range of 130 to 210 
Gev 2 . The definitions of the control regions and the number of counted events in each region 
is given in Table [TJ The table also shows the expected number of ti events in each region. 
The prediction shown in the table is given by: 



wS - = (15) 

JV Admass 

where Nx is the number of events in region X and R ma ss = (NbNc) / (NaNd) was introduced 
in [16] and represents correlations that might exist between the m? et<1 and m? etx distributions. 
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A value calculated using POWHEG [80H83] of R ma ss = 0.89 was used in this study. 
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FIG. 5: The m? et2 versus m? etx distribution for data (a) and tt MC (b) events with at least 
one jet with px > 400 Gev and \r]\ < 0.7 using R = 1.0 Midpoint cones. All events are 

required to have Smet < 4 [75] 



In QCD events, no correlation is expected between the missing energy, which comes 
mostly from instrumental effects, and the leading jet mass. In semileptonic tt events, one 
expects a leading massive jet along with large missing energy. This is shown (using the 
missing energy significance) in Fig. [6] for data and tt MC. The results are shown in Table [ll| 
as well as a prediction for the signal region obtained in a similar fashion to the prediction 
for the all hadronic case, omitting the R maS s factor. 

A-priori a comparable signal-to-noise and acceptance in the all-hadronic and lepton+jets 
channels were expected. Thus the two channels were combined and used to set an upper 
limit on Standard Model tt production for top quarks with pf > 400 Gev. A 95% C.L. 
limit was calculated, folding in the systematic uncertainties, using a Bayesian approach 
employing a flat prior on the cross section and treating the sources of systematic uncertainty 
as nuisance parameters [SU ES]. The resulting upper limit is 38 fb at 95% C.L.. This is 
approximately an order of magnitude higher than the estimated Standard Model rate, and 
is limited by the QCD background rates. It is, however, the most stringent limit on boosted 
top quark production to date. Furthermore, the "expected limit" was calculated by using 
the background estimated from the data-driven technique and assuming an observation of tt 
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Region 


m jetl 


m jet2 


Data 


tt MC 




(Gev 2 ) 


(Gev 2 ) 


(Events) 


(Events) 


A 


(30, 50) 


(30,50) 


370 


0.00 


B 


(130,210) 


(30,50) 


47 


0.08 


C 


(30,50) 


(130,210) 


102 


0.01 


D (signal) 


(130,210) 


(130,210) 


31 


3.03 


Predicted QCD in D 






14.6 ±2.76 





TABLE I: The observed number of events in the three control regions used to predict the 
background rate in the signal region (region D). The tt MC event rates in each region are 

also shown [75] 




(a) Data (b) tt MC 

FIG. 6: The Smet versus m? etx distribution for data (a) and tt MC (b) events with at least 
one jet with px > 400 Gev 2 and \r]\ < 0.7 using R = 1.0 Midpoint cones [75] 

events at the expected level of 4.9 events. The upper limit is 33 fb at 95% C.L., which is lower 
than the observed limit since an excess of events above the expected signal plus background 
is observed in the data. It is interesting to set a limit on the fully hadronic channel, as this 
creates a selection that is sensitive to pair production of two massive objects near the mass 
of the top quark. As the main interest in this case is in beyond-SM contributions to this 
final state, the background estimate the for the expected tt contribution of 3 ± 0.8 events is 
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Region 


m jetl 


Smet 


Data 


MC 




(Gev 2 ) 


(VGev 2 ) 


(Events) 


(Events) 


A 


(30,50) 


(2,3) 


256 


0.01 


B 


(130,210) 


(2,3) 


42 


1.07 


C 


(30,50) 


(4,10) 


191 


0.03 


D (signal) 


(130,210) 


(4,10) 


26 


1.90 


Predicted QCD in D 






31.3 ±8.1 





TABLE II: The observed number of events in the three control regions used to predict the 
background rate in the signal region (Region D) for high Smet and high m jeil . The ti 
MC event rates are also shown normalized to the sensitivity of the data sample [75] 

included. Taking out the top quark hadronic branching fraction of 4/9, the upper limit is 
20 fb at 95% C.L. 

V. CONCLUSIONS 

If beyond standard model physics is present it should reveal itself in jet rich events. 
Understanding the substructure of such jets provides us with a handle in the analysis and 
differentiation of the possible models describing the events at current and future colliders. 
Any machine probing the multi-TeV scale will produce electroweak scale resonances which 
will be highly boosted. The methods we describe above should guide us and contribute 
in disentangling these resonances from overwhelming QCD background. Currently, these 
methods are being implemented on data to test their validity, and one such example is the 
CDF measurement. At CDF the jet mass, angularity, and planar flow were measured for 
the first time for jets with px > 400 Gev, using both Midpoint and anti-£>r jet algorithms. 
A data-driven pileup correction technique was developed and used. A good agreement 
between the data and PYTHIA MC predictions was observed. The jet mass distribution 
for the data was consistent with the NLO QCD jet function at large mass. Angularity and 
planar flow show that the data follows the two-prong assumption for boosted massive QCD 



27 



jets. And interesting agreement between the two studied jet algorithms is observed. Finally, 
a boosted top quark search based mainly on jet mas yielded an upper limit on the cross 
section of Standard Model tt production at high p T . 
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